Movie-DiC: a Movie Dialogue Corpus for Research and Development

نویسنده

  • Rafael E. Banchs
چکیده

This paper describes Movie-DiC a Movie Dialogue Corpus recently collected for research and development purposes. The collected dataset comprises 132,229 dialogues containing a total of 764,146 turns that have been extracted from 753 movies. Details on how the data collection has been created and how it is structured are provided along with its main statistics and characteristics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A User Simulator for Task-Completion Dialogues

Despite widespread interests in reinforcement-learning for task-oriented dialogue systems, several obstacles can frustrate research and development progress. First, reinforcement learners typically require interaction with the environment, so conventional dialogue corpora cannot be used directly. Second, each task presents specific challenges, requiring separate corpus of task-specific annotate...

متن کامل

Modeling a Dialogue Strategy for Personalized Movie Recommendations

This paper addresses conversational interaction in useradaptive recommender systems. By collecting and analyzing a movie recommendation dialogue corpus, two initiative types that need to be accommodated in a conversational recommender dialogue system are identified. The initiative types are modeled in a dialogue strategy suitable for implementation. The approach is exemplified by the MADFILM mo...

متن کامل

Pure and Embedded Film Genres on the Movie Comprehension of Iranian EFL Learners

The present study investigated the effect of single and multiple genre movies on listening comprehension of upper intermediate language learners. To this end, a language proficiency test was administered to 40 male and female postgraduates of different majors attending International English Language Testing System (IELTS) classes and ultimately 25 upper intermediate language learners were selec...

متن کامل

Scene Boundary Detection from Movie Dialogue: A Genetic Algorithm Approach

Movie scripts are a rich textual resource that can be tapped for movie content analysis. This article describes a mechanism for fragmenting a sequence of movie script dialogue into scene-wise groups. In other words, it attempts to locate scene transitions using information acquired from a sequence of dialogue units. We collect movie scripts from a web archive. Thereafter, we preprocess them to ...

متن کامل

A psychological analysis of the movie Under the Smokey Roof (2017) based on the family therapy theories

Movies are considered an effective educational resource for students, especially those who study Psychology. The purpose of this study is to analyze the movie "Under the Smokey Roof" directed by Pouran Derakhshandeh, based on the family therapy theories. This movie shows the story of a family struggling with different social and psychological issues. In this article, a descriptive-analytical me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012